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Assistant Commissioner for Patents 



Sir: 
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The undersigned Gudmundur O. Hreggvidsson, an Icelandic citizen, declares and states as 
follows: 



1 . 1 graduated from the University of Edinburgh, Scotland, in 1 993 with a Ph.D. in Molecular 
Biology- 1 was senior scientist at the Biotechnology Department of the Technological 
Institute of Iceland from 1990-1998 and Assistant Professor at the University of Iceland from 
1 999 to present. I am the VP of operations for Prokaria ehfl, Reykjavik, Iceland. 



2. 1 am a co-inventor of the abovc-captioned patent application. 



3. 1 have directed the planning, execution, supervision, and evaluation of the experimental 
protocols and data of the work done by my student Michael Johansson for his M.Sc. thesis at 
the University of Lund, Sweden. These experiments and the obtained results arc described in 
a peer-reviewed publication enclosed as Exhibit 1, of which I am a co-author (Halldorsdottir 
et aL (1998)). I am also co-author of a second peer-reviewed publication enclosed as Exhibit 
2 (Wicher et al. (2001)) that describes much of the same experimental work as in the above 
patent application. 

Experiments demonstrating that the native form of the thermostable cellulase Cell2A has a 
cytotoxic effect on the £. coti host are described in Exhibit 2, Briefly, full-length cellulase 
Cell2A from Rhodothermus marinus was expressed in E. coli strain BL 21(DE3), which was 
cultured under normal conditions (see "Materials and methods" Exhibit 2, p. 579). When 
expression was induced, severe cell lysis is observed in the cells expressing the full-length 
cellulase (see Figure 2 of Exhibit 2). Cellulase production was very low (less than 1% of 
total protein), as shown in Figure 3 of Exhibit 2, which is an SDS-PAGE gel showing the 
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crude cell content of induced E, coli expressing full-length cellulase (lane 2) as compared to 
truncated (lanes 3 and 4) cellulase which show high expression (about 40% of total protein). 
In addition to the low yields produced, the full-length cellulase was not stable at 4°C and lost 
activity when stored for several weeks. See exhibit 2, p. 581, first paragraph. 

In Exhibit 1, no experiments were reported that could have clearly shown this cytotoxic 
effect of the full-length cellulase. The truncated form had not been constructed at that time. 
Although Exhibit 1 reports cloning and expression of full-length cellulase, the paper does not 
discuss the expression efficiency, which in fact was quite low. Further, it should be noted that 
the comparative experiments reported in Exhibit 2 use the same cell hosts, vectors, and full- 
length gene construct. Therefore, the experiments reported in Exhibit 2 arc not contradictory 
to the experiments of Exhibit 1, but rather continuing in-depth studies of the same and related 
materials* 

Exhibit 2 further reports comparative activity studies for full-length cellulase and truncated 
variants (see p. 581, 2nd coL-582, 1st col. and Table 2). It is shown that truncated variants 
unexpectantly have 3 to 4-fold higher specific activity compared to the full-length enzyme, 
and one variant in particular (A(SP)Cell2A: cellulase without 17 N-terminal amino acids) 
retains activity for much longer periods of time than the full-length enzyme. 

4. From my direct involvement in the above described experiments, I conclude that the full- 
length cellulase from Rhodothermns has a cytotoxic effect on & colL The full-length 
cellulase was expressed in much lower yields than N-terminal truncated forms expressed in 
the same host under the same conditions. I further conclude that the described truncated 
variants have 3 to 4-fold higher activity than the full-length cellulase, and that one truncated 
variant lacking the first N-terminal amino acids of the full-length enzyme retains its activity 
at high temperatures for longer periods of time than the full-length enzyme. Additionally, the 
full-length cellulase had a reduction in stability and lost activity when stored for several 



5, I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and further 
that these statements were made with the knowledge that wilful false statements and 
the like so made are punishable by fine or imprisonment, or both, under Section 1001 
of Title 1 8 of the United States Code, and that such wilful false statements may 
jeopardize the validity of the application or any patent issuing thereon. 
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Abstract A gene library from the thermophilic eubac- 
terium Rhodothermus marinus, strain ITI 378, was 
constructed in pUC18 and transformed into Escherichia 
coll Of 5400 transformants, 3 were active on carboxy- 
methylcellulose. Three plasmids conferring cellulase 
activity were purified and were all found to contain the 
same cellulase gene, eel A. The open reading frame for 
the celA gene is 780 base pairs and encodes a protein of 
260 amino acids with a calculated molecular mass of 
28.8 kDa. The amino acid sequence shows homology 
with cellulases in glycosyl hydrolase family 12. The celA 
gene was overexpressed in E. coli when the pET23, T7 
phage RNA polymerase system was used. The enzyme 
showed activity on carboxymethylcellulose and lichen- 
an, but not on birch xylan or laminarin. The expressed 
enzyme had six terminal histidine residues and was 
purified by using a nickel nitrilotriacetate column. The 
enzyme had a pH optimum of 6-7 and its highest 
measured initial activity at 100 °C. The heat stability of 
the enzyme was increased by removal of the histidine 
residues. It then retained 75% of its activity after 8 h at 
90 °C. 
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Introduction 

Rhodothermus marinus is a marine thermophilic eubac- 
terium isolated from alkaline submarine hot springs 
(Alfredsson et al. 1988). It is an obligate aerobe and 
grows optimally at 65 °C, pH 7.0 and 2% NaCl (Al- 
fredsson et al. 1988). Sequence analysis of a 16S ribo- 
somal RNA gene revealed that R. marinus is most 
closely allied to the Flexibacter-Cytophaga-Bacteroides 
group (Andresson and Fridjonsson 1994). It has been 
shown to produce several thermostable glycosyl hydro- 
lases, including amylases (Hreggvidsson et al. 1992), a 
xylanase (Nordberg Karlsson et al. 1997), a P-xylosidase 
(Manelius et al. 1994) and a cellulase (Hreggvidsson et al. 
1996). A gene from R. marinus encoding a thermostable 
|3-glucanase belonging to glycosyl hydrolase family 16 
has been sequenced and the enzyme characterized 
(Spilliaert et al. 1994). 

A classification of glycosyl hydrolases, in families 
based on amino acid sequence similarities, was proposed 
by Henrissat (1991) and, in 1993, 480 glycosyl hydro- 
lases with 52 different EC numbers were classified in 45 
families (Henrissat and Bairoch 1993). 

There are many potential applications for cellulases 
in industry and biotechnology (Bayer et al. 1994; Beguin 
and Aubert 1994). Thermostable enzymes are attractive 
candidates for some of these applications (Herbert 1992; 
Kristjansson 1989). In this paper we describe the clon- 
ing, sequencing and overexpression of a R. marinus gene 
encoding a thermostable cellulase as well as comparison 
of the amino acid sequence of the R. marinus cellulase to 
that of its hyper-thermophilic and mesophilic counter- 
parts. 



Materials and methods 

Bacteria, plasmids and culture conditions 

R. marinus strain ITI 378 was obtained from the strain collection of 
the Technological Institute of Iceland. It was grown at 65 °C with 
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shaking in medium 162 (Degryse et al. 1978) with 2% NaCl added 
(Alfredsson et al. 1988). Escherichia coliTG\ sup E hsd A5 thi A(lac- 
proAE), FtraD36 proAB* lacf* lacZAM\5 was used as a host for 
the cloning vectors pUC18 and pUC19 and for propagation of 
Ml 3 phages mpl8 and mpl9. £. coli GE1731 (Thorbjarnardottir 
et al. 1995) was used as the host strain for recombinant plasmid 
pET23 (Studier et al. 1990). E. coli was grown in Luria-Bertani 
(LB) broth containing 10 g tryptone, 5 g yeast extract and 5 g 
NaCl/l. Ampicillin was added to the LB/agar medium to a final 
concentration of 100 ug/ml for the selection of recombinants. X-gal 
(5-bromo-4-chloro-3-indoIyI p-D-galactopyranoside) and IPTG 
(isopropyl p-D-thiogalactopyranoside) were added at a final 
concentration of 25 ug/ml and 0.1 mM respectively. IPTG was 
added to LB medium to a final concentration of 0.4 mM for the 
induction of expression for 4 h of the celA gene in the pET23 
expression vector system. For in situ enzyme assays (Wood et al. 
1988), the media were supplemented with 0.5% polysaccharide 
substrate. 



Polysaccharide substrates 

The following substrates were used: Avicel (Fluka Chemica), Ko- 
njak mannan (glucomannan, from Megazyme International Ltd.), 
microgranular cellulose, lichenan, laminarin, birchwood xylan (all 
from Sigma) and carboxymethylcellulose (CMC). The CMC types 
used were CMC (Sigma), Blanose (Hercules) 7M31F (degree of 
substitution, DS, 0.7), here designated CMC1, and Blanose 
12M31P (DS 1.3), here designated CMC2. CMC1 was, unless 
otherwise stated, used for enzyme assays and activity tests in gels. 
CMC (Sigma) was used for plate assays. 



Construction of a genomic library and isolation 
of the cellulase gene 

R. marinus chromosomal DNA was prepared and a gene library 
constructed in the pUCl 8 vector as described before (Spilliaert et al. 
1994). E. coli TGI cells were made competent by the method of 
Chung et al. (1989), transformed with the library and plated on LB/ 
agar plates containing X-gal and IPTG. White colonies, with re- 
combinant pUC18 plasmids, were transferred onto LB plates 
containing 0.5% CMC. Plates were incubated overnight at 37 °C 
and then for 5 h at 65 °C, and stained with 0.1% Congo red 
(Sigma) for 15min according to Wood et al. (1988). A zone of 
hydrolysis on a red background was visible around colonies 
showing cellulase activity. 



Plasmid DNA analysis 

Plasmid DNA was prepared according to standard methods and the 
inserted DNA digested with restriction endonucleases in single 
or double digests for the construction of a restriction map. For 
Southern-blot analysis, restriction fragments were transferred onto 
Hybond N hybridization membranes (Amersham) and hybridized to 
a DNA fragment, previously gel-purified and labelled with dig- 
oxigenin, using a DIG labelling kit (Boehringer Mannheim). 
Restriction fragments were cloned into M13mpl8 and M13mpl9 
and both strands of the DNA were sequenced by the dideoxy-DNA 
chain-termination method and using a Sequenase 2.0 sequencing 
kit (United States Biochemical Corp.) or a Thermo Sequenase cycle 
sequencing kit (United States Biochemical Corp.). A homology 
search and sequence analysis were performed by using the available 
nucleotide and/or protein sequence databases (GenBank, pit, swiss- 
prot) using the standard search algorithms. Compositional analysis 
of protein sequences was done by using the SAPS computer pro- 
gram (Brendel et al. 1992). The computer program ClustalW1.6 
(Thompson et al. 1994) was used to generate pairwise sequence 
alignments. 



Overexpression of the celA gene in pET23 and purification of 
the cellulase 

The celA gene was cloned via the polymerase chain reaction (PCR) 
between the Noel and HindlU sites of pET23b + in order to 
overexpress the gene. The primers used were forward 
5'GGAATTCCATATGAACGTCATGCGTGCGG, and reverse 
3'CCCAAGC TTCTGCACCGTTACGGAAA. The resulting re- 
combinant plasmid was named pET23bAH. The GE1731 deriva- 
tive containing the recombinant plasmid pET23bAH (GE2205) was 
grown overnight at 32 °C. The overnight culture was diluted 1:50 
and grown at 32 °C until A^oo reached 0.8. The culture was induced 
with 0.4 mM IPTG and aliquots of 100 ul were taken for sodium 
dodecyl sulphate/polyacrylamide gel electrophoresis (SDS/PAGE) 
analysis after 0, 1, 2 and 4 h incubation. 

After 4 h of induction, the cells were harvested for protein 
purification. The cellulase was purified using a nickel nitrilotriac- 
ctate (Ni-NTA) column from QIAGEN according to the manu- 
facturer's protocol. Purification was monitored by visualizing 
samples on SDS/PAGE and by in situ enzyme assay. Enzyme ac- 
tivity was measured with the standard assay and the protein con- 
centration was measured with the Bio-Rad (Bradford) protein 
assay (Zaman and Verwilghen 1979). 



Enzyme characterization 

SDS/PAGE was performed in a 12% gel by the method of Laemmli 
(1970) and proteins were stained with Coomassie brilliant blue R- 
250 (Sigma). Enzyme activity was detected in situ in SDS/PAGE 
containing 0.13% CMC. The gel was incubated for 4 h at 70 °C 
and stained with 0.1% Congo red. For detecting activity in SDS/ 
PAGE gels not containing substrate, the gels were covered with a 
1% agarose layer containing 0.15% CMC. 

The standard assay for cellulase activity was done by incubating 
the enzyme (15 mg/ml) at 70 °C for 15 min, with 0.5% (w/v) CMC 
as a substrate in 0.1 M phosphate buffer (Na 2 H 2 P0 4 ), pH 7.0. The 
reducing sugars released were detected by the dinitrosalicylic acid 
method (Sumner and Somers 1949) with glucose as a standard. One 
unit of enzyme activity is the amount that leads to the release of 
I umol reducing sugars/min. All activity measurements were done 
in triplicate. 

The temperature optimum was determined by running the 
standard assay at 60, 70, 75, 80, 90, 95 and 100 °C. Thermostability 
was measured by incubating the enzyme (15 mg/ml in phosphate 
buffer, pH 7.0) at 90 °C for 8 h. Samples were taken at intervals 
and residual activity was measured by the standard assay: All as- 
says were done in triplicate. 

The pH optimum was measured by running the standard assay 
at different pH values. The buffers used were citrate/phosphate 
buffer (0.2 M) for pH 4-5 and sodium phosphate buffer (0.2 M) for 
pH 6-10. For pH 9 and 10, which are outside the buffering range, 
the pH was monitored before and after the reaction. Significant 
changes in pH due to the reaction were not observed. 



Results 

Cloning and sequencing of the R. marinus celA gene 

Three clones among 5400 transformants were found to 
produce a clear halo on 0.5% CMC. Plasmid DNA was 
isolated from these clones, pLHQ, pLHC 2 and pLHC 3 , 
and the size of the inserts determined. The clones had 
inserts of 3 kb, 5 kb and 1.5 kb respectively. Plasmid 
PLHC3 w as selected for restriction endonuclease map- 
ping and sequencing. An insert of 1560 bp was found to 
contain an open reading frame of 780 bp. Two potential 
methionine initation codons were found. Translation 
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1 S . lividans 

2 S . rochei 

3 A.aculeatu 

4 A. kawachii 

5 H . jecorina 

6 H . insolens 

7 E. carotovo 

8 CelB T.mar 

9 CelB T. nea 

10 CelA T.mar 

11 CelA T.nea 

12 CelA R.mar 



1 15 
MRTLRPQARAPRGLL 
— -MPRLRHHPRTLR 



-MLKSALL 
-MQTVNTQ 



16 30 
AALGAVLAAFALVSS 
AVSAALLTALAALAA 

MKAFHLLA 

MKLSMTLS 

MKFLQVLP 

LGPAAVSVQSASIPT 
PHRIFRVLLPAVFSS 

MRWAVLLMWFS 

MPXWSFLLWS 
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MNVMRAVL 

91 105 

GSAPTNGAPK 

GSVPTNGAPK 

TSWSTKWNWS 

ASWHTKWTWS 

ASWHADWQWS 

IQWSTAWEWQ 

WNWHWPSSTHS V 

ADLYNIVLQNPDSWV 
ADIYNIVLQNPDSWV 
ADIQNLSPKEPERYV 
GDVEDLSAREPERYI 

ADHDNGKNVA 

181 195 
MIWFNRVGPIQPIG- 
MIWFKRVGQIQPIG- 
MIWLARYGGVQPIG- 
MIWLARYGSVQPIG- 
MIWLGKYGDIGPIG- 
MIWLARYGGIYPIG- 
MIWLNDTN-AGPAG- 
MVWFYNNV-LMPGGQ 
MVWFYNK1-LMPGGQ 
MVWFYFNN-LTPGGE 
MVWFYFNE-LTPGGK 
MIWLNWNGGVMPGG- 
271 285 
ENDWYLTSVQAGFEP 
GNDWYLTSIQAGFEP 
ASSQYLITLQFGTEP 
ASSQHLITLQCGTEP 
AAGQYVLSYQFGTEP 
AREQNLI V YQVGTEC 
SDEKYISSVEFGTEI 
FEELYFCVWEIGTEF 
FDEMYFCVWEIGTEF 
FEDLYFTVWEIGTEF 
FDDLYFTVWEIGTEF 
RPEWYLHAVETGFEL 
361 375 
LPSGQRITNAWNASL 
LPSGQSWHAWNASV 



VLSLLLLFGCDWLFP 
106 120 

-SYPSVFNGC-HYTN 
-SYPSVFNGC-HYTN 

-GGENSVK SYAN 

-GGEGTVK SYSN 

-GGQNNVK — SYQN 

-GAPDNVK SYPY 

KAYPSLVSGW-HWTA 
HGYPEIYYGYKPWAG 
HGYPEIYYGYKPWAA 
LGYPEFYYGYKPWEN 
LGYPEFYYGYKPWER 
-AYPAIYFG-CHWAP 
196 210 

SPVGTASVGGR- 

SQVGTASVAGR- 

SQIATATVDGQ- 

KQ IATATVGGK- 

SSQGTVNVGGQ- 

TFHSQVNLAGR- 

DYIETVFLGDS- 

KVDEFTTTVEINGVK 
KVDEFTTTIEINGSP 
KIEEFTIPFVLNGES 
KVGE YTVS FELNGEH 

S RVATVELAGA- 

286 300 
WQN-GAGLAVNSFSS 
WQN-GAGLAVNSFSS 
FTGGPATLSVSNWSA 
FTGG PAT FTVDNWT A 
FTG-SGTLNVASWTA 
FTGGPARFTCROFRA 
FGG-DGQIDITEWRV 
GDP-NTTAAKFGWTF 
GDP-NTTAAKFGWTF 
GSP-ETKSAQFGWKF 
GSP-ETKSARFGWTF 
WEG - GAGLRTAD FSV 
376 390 
TPSSGSVTATGASHN 
S PS SGAVTATGPAES 



46 60 
TTICEPFGTTTIQGR 
TTICEEFGSTVIQGR 
AQLCDQYATYTGGV- 
QTMCSQYDSASSPP- 
QTSCDOWATFTGNG- 
RSLCELYGYWSGNG- 
SSSNDADKLYFGNNK 
TSVGATDISFNGFP- 
TDIGATDITFKGFP- 
TKPGTSDFVWNGIP- 
TAPGTADFRWNDMP- 
VELCGRHDARDVAGG 
136 150 
SAAPSSISYGF-VDG 
SSAPSSISYGF-VDN 
SQ I PTTARWS Y- DNT 
SSIPTSVTWSQ-DOT 
SSMPTTASWSY-SGS 
NSMRTSVSWTY-DRT 
KSITSNVTYSI-KAT 
PDFYVTLDYSIWYEN 
PDFYVTLDYSIWYEN 
KSFSVEVSFDIHHEP 
ESFTVELSFEIDHMP 
GAVRRAHELDVTPIT 
226 240 
-ANGSNDVLSFVAPS 
-GNGTNDVLSFVAPS 
-ANGSQKTYSFVAPT 
QAGAEQKTYS FVAGS 
- YNG AMQVYS FVAQT 
-YNGHMRVYSFLPPS 
DNGGGWNVFSFVHTS 
-WGWDYLAFRLTT PM 
-WGWDYLAFRLTTPM 
-WGWDYLAFRLKDPV 
-WNWDYLAFRLKNPV 
-WDWNYIAYRRTTPT 
316 330 
PGDPGGPSACAVSYG 
PGDPGTPAACTVSYA 



31 45 

LVTAAAP AQAD 

LLTATAP AQAD 

ALAGAA VAQQ 

LFA — A TAMG 

ALIP AALA 

IPANLE PRQI 

LLLSSLT VSAA 

ALLFSS EWL 

AFLFSA EWL 

MVLM 

MVEL 

DGDMGKEPEPEPEPT 
121 135 
-CSPGTDLPVRLDTV 
-CSPGTALPARISGI 
-SGLTFN-KKLVSQI 
-SGLTFD-KKLVSDV 
-SQBUPQKRTVNSI 
-VGKQIQRGRKISDI 
GYTENSGLPIQLSSN 
HNSGVEFLPVKVKDL 
HNSGTEILPVKVKDL 
HTAEGSKLPVPVSSM 
HAAEGTKLPLWSSV 

ARAIRDCAARA 

211 225 

---TWEVWSA 

TWEVWSG 

TWELWYG 

SWEVWYGTST-- 

SWTLYYG 

TWDLWTG 

SWNVFKGWIN-A 

QETKWDVYFAP 

VETKWDVYFAP 

VEGTHELHLAE 

KRGIWELWHAE 

TWEVWYAD 

301 315 

TVETG TPGGTD- 

TVNTGGSQNPGDPNG 

SV Q 

SVN 

SIN 

DLW 

DVK 

RDFSVEWK 

KDFSVEIGE 

ENFSIDLEVRE 

NNFS I DMEVKG 

TVQ 

391 405 406 420 421 

ARIAPGGSLSFGFQG TYGGAFAEPTGFRLN GTACTTV 
ARIAAGGSQSFGFQG AYSGSFAQPAAFQLN GTACSTV 



61 75 
- YWQNNRWGS T A PQ 
-YWQHNRWGTSATQ 
-YTINNNLWGKDAGS 
-YSVNQNLWGEYQGT 
-YTVSNNLWGASAGS 
-YELLNNLWGKDTAT 
-YYLFNNVWGKDEIK 
-VTMEINFWNVKSYE 
-VTMEINFWNVKSYE 
-LSMENLWNIKEYS 
-LSMEWLWNVERYT 
RYRVINNVWGAETAQ 
151 165 
AVY-HASYDIWLDPT 
AVY-NASYDIWLDPT 
GIRADVAYDLFTAAD 
NVQADVSYDLFTAAN 
N IRANVAYDL FTAAN 
DIRANVAYDVFTARD 
GTY-NAAYDIWFHTT 
NLPINLAMETWITRS 
DLPINLAMETWITRK- 
SLPLNFAMETWLTRE 
SLPLNFAMETWLTRE 
TGRWNAAYDIWFSPV 
241 255 
-AISGWSFDVMDFVR 
-AMSSWSFDVMDFVR 
-PITSFQGDVNDFFK 
- PINSWSGDIKDFFN 
-NTTNYSGDVKNFFN 
GDIRDFSCDIKDFFN 
-GTNSASLNIRHFTD 
-KEGKVKINVKDFVQ 
-KDGRVKFNVKDFVE 
-KKGRVKFDVRHFLD 
- RKGRVRFNVKDFLD 
— TSVSELDLRAFID 
331 345 
TNVWQDGFTADVTVT 
TNVWPGGFTANVTVT 



76 90 
CVTATDTGFRVTQAD 
CVTATDSGFRVTQAD 
-GSQCTTVNSASSAG 
-GSQCVYVDKLSSSG 
-GFGCVTAVSLSG-G 
SGWQCTYLDGTNNGG 
GWQQT IFYNSPISMG 
GETWLKFDGEKVEFY 
GETWLKFDGEKVQFY 
GSVAMKFDGEKITFD 
GTWMRFDGERLTFN 
C 1EVGLETGNFT I TR 
166 180 
A-RTDGV-NQ- -TEI 
P-RTDGV-NR- -TEI 
INHVTWS -GD — YEL 
ADHATSS-GD--YEL 
PNHVTYS-GD--YEL 
PDHPNWG-GD--YEL 
D-KANWD-SSPTDEL 
PDQTSVSSGD--AE1 
PDQTSVSSGD--VE1 
KYQTEASIGD--VEI 
K YQVEAS IGD — VE I 
TNSGNGYSGG— AEL 
256 210 

ATV--ARGLA 

ATV--ARGLA 

Y LTQ — NHGFP 

Y LTQ — NQGFP 

Y LRD — NKGYN 

Y LER--NHGYP 

Y----LVQ--TKQWM 
KAAEWKKHSTRIDN 
KAAEVIKKHSTRVEN 
AAGKALSS-SARVKD 
VAGEYLSR-STRVKD 

DAV — ARGYI 

346 360 
NTGTAPVDGWQLAFT 
NNGSAPVDGWRLAFT 
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Fig. 1 Multiple alignment of the deduced amino acid sequences of the 
celA gene from Rhodotltermus marinus and the 11 sequences that 
showed homology to it. Amino acid residues idenucal to those of the 
CelA protein are shown in bold. The putative catalytic residues 
(Glu 15 * and Glu 242 ) are underlined. Accession numbers for the 
deduced sequences are as follows: (/) Streptomyces lividans, celB, 
U04629; (2) Streptomyces rochei, eglS, X73953; (5) Aspergillus 
aculeatus, FI-CMCase mRNA, X52525; (4) Aspergillus kawachii, 
mRNA for an endoglucanse, D 12901; (5) Hypocrea jecorina, 
endoglucanase gene, AB003694; (d) Hwnicola insolens, mRNA 
fragment, A22907; (7) Envinia carotovora, celS, M32399, (8, 9) 
Thermotoga maritima, eel A and celB, Z69341; (10, II) Thermotoga 
neapolitana, celA and celB, Z86103, U93354; (12) Rhodothermus 
marinus, celA U72637 



initation at the first codon would result in a 780-bp 
message corresponding to a polypeptide of 260 amino 
acids with a calculated molecular mass of 28.8 kDa, 
while translation initation of the second codon would 
result in a 771 -bp message corresponding to a polypep- 
tide of 257 amino acids. By using the computer program 
PSORT, the polypeptide of 260 amino acids was found 
to contain a putative signal peptide of 17 amino acids 
(Nakai and Kanehisa 1991). Assuming that the prepro- 
tein is 28.8 kDa, the predicted mature protein would be 
243 amino acids with a calculated molecular mass of 
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26.9 kDa. No sequences resembling known eubacterial 
promoter sequences were identified upstream from the 
celA gene but promoter sequences in R. marinus have 
not been characterized. The cellulase gene has a G + C 
content of 65.4%, which corresponds to the 64.4% value 
found for R. marinus (Alfredsson et al. 1988). The cel- 
lulase gene was named eel A. The nucleotide sequence 
data will appear in the GenBank/EMBL/DDBJ data- 
bases under the accession no. U72637. Partial nucleotide 
sequencing confirmed that plasmids pLHQ and pLHC 2 
also contained the celA gene. 



Homology with cellulases from other microorganisms 

A search for homology between the CelA protein and 
cellulases from other microorganisms was done by using 
the available nucleotide and protein databases with the 
BLAST algorithm (Altschul et al. 1990). The CelA 
protein showed homology to 11 cellulases from 9 spe- 
cies. Eight of these enzymes have been assigned to cel- 
lulase family H, which corresponds to glycosyl hydrolase 
family 12 (Liebl et al. 1996; Tomme et al. 1995). It is 
concluded that CelA of R, marinus belongs to this 
family. The homology between CelA and the other en- 
zymes is low, identity ranging from 17% to 27%. Mul- 
tiple alignment of the deduced amino acid sequence of 
CelA of R. marinus and the other cellulases is shown in 
Fig. 1. The accession number of each sequence is given 
in the figure legend. The amino acid composition of the 
12 cellulases is given in Table 1. 

An open reading frame of about 500 bp was found 
about 1 kb upstream of the celA gene in the pLHC 2 
insert. The derived amino acid sequence showed ho- 



mology to argininosuccinase from both eukaryotes and 
prokaryotes. 

Overexpression of *the eel A gene in E. coli, 
and SDS/PAGE analysis 

In order to overexpress the celA gene in E. coli, a PCR 
copy of the gene with Ndel-Hindlll restriction sites was 
Iigated into the expression vector pET23b( + ) cut with 
the same enzymes. The resulting plasmid, named 
pET23bAH, encodes a cellulase protein with six termi- 
nal His residues. 

Synthesis of the enzyme was monitored by removing 
aliquots for SDS/PAGE analysis during 4 h induction 
with 0.4 mM IPTG: Coomassie-blue staining of SDS/ 
PAGE gels revealed two bands in induced samples. One 
band had an apparent molecular mass near 50 kDa, but 
the other had an apparent molecular mass of 28 kDa, 
which corresponds to the expected calculated value for 
the R. marinus cellulase (data not shown). 

The cellulase was purified by using the Ni-NTA spin 
column. Purification was monitored by removing ali- 
quots for SDS/PAGE analysis and in situ assays, after 
column loading, washing and elution. Coomassie blue 
staining of the purified cellulase protein revealed a 
strong band of apparent molecular mass 28 kDa, which 
is in good agreement with the calculated value of 
28.8 kDa (Fig. 2). 

The purified protein was treated with proteinase K 
(0.01 mg/ml) for 4 h at 37 °C. After this treatment the 
protein did not adsorb to the Ni-NTA column. This 
implies that the histidine tail and possibly linking amino 
acids were removed by the treatment. No reduction in 



Table 1 Amino acid composition of 12 cellulases from glycosyl hydrolase family 12. For the full names of the organisms see Fig. 1. The 
percentages for Glu from thermophilic organisms are shown in bold 

Amino Composition (%) 

acid 

A. acul A. kaw E. caro H. irtso H.jeco S. livid S. roch T. mar A T. mar B T. nea A T. nea B R. mar 



A 


11.0 


7.9 


5.3 


6.3 


9,4 


12.6 


13.1 


4.7 


4.7 


4.3 


4.7 


11.9 


R 


1.3 


0.4 


1.5 


6.7 


1.3 


4.2 


4.2 


2.7 


1.8 


6.2 


1.8 


6.9 


N 


5.9 


4.2 


8.0 


6.3 


9.0 


5.0 


6.0 


3.9 


5.5 


4.7 


4.7 


5.4 


D 


4.2 


5.4 


6.1 


6.7 


3.0 


4.7 


3.7 


5.1 


5.8 


5.4 


6.9 


6.2 


C 


0.8 


1.3 


0.0 


2.0 


0.9 


1.6 


1.6 


0.0 


0.4 


0.0 


0.4 


1.9 


E 


1.7 


2.5 


3.0 


3.1 


0.9 


1.8 


1.6 


10.9 


7.7 


10.9 


7.7 


6.5 


Q 


6.8 


6.3 


3.0 


3.9 


6.0 


3.7 


4.2 


1.2 


1.8 


0.4 


1.5 


0.8 


G 


9.7 


9.2 


7.6 


9.8 


11.5 


11.8 


11.0 


7.0 


6.2 


7.4 


6.2 


10.4 


H 


1.3 


1.3 


2.7 


1.2 


0.9 


0.5 


1.0 


1.6 


1.1 


1.6 


1.1 


1.5 


I 


4.2 


2.9 


6.1 


5.5 


3.8 


2.9 


3.1 


4.3 


4.0 


2.7 


5.8 


4.2 


L 


6.3 


5.4 


6.1 


7.5 


5.6 


5.0 


4.5 


7.8 


6.6 


8.2 


5.8 


6.9 


K 


3.4 


4.2 


4.5 


2.4 


2.6 


0.3 


0.3 


7.0 


6.6 


4.7 


6.6 


0.8 


M 


0.8 


2.1 


1.5 


1.6 


1.7 


0.8 


1,0 


2.7 


2.6 


3.1 


2.6 


1.5 


F 


4.2 


3.8 


4.9 


3.5 


3.8 


4.5 


4.2 


7.8 


7.3 


7.4 


1.1 


3.1 


P 


3.0 


3.3 


3.0 


5.1 


3.0 


6.8 


6.5 


5.4 


4.4 


4.3 ' 


4.7 


4.6 


S 


10.5 


14.2 


12.1 


6.7 


11.5 


8.9 


11.0 


7.8 


5.5 


5.4 


4.7 


3.1 


T 


10.1 


10.5 


8.3 


6.3 


8.1 


11.3 


9.4 


5.1 


6.9 - 


6.2 


7.7 


6.9 


W 


4.2 


4.2 


5.7 


4.7 


3.8 


2.9 


2.9 


4.3 


4.7 


4.3 


4.4 


5.4 


Y 


5.1 


4.6 


4.5 


6.3 


6.8 


2.4 


2.4 


3.9 


5.1 


4.7 


5.1 


3.8 


V 


5.5 


6.3 


6.1 


4.3 


6.4 


8.4 


8.4 


7.0 


11.3 


8.2 


9.9 


8.1 
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kDa M 1 2 3 4 




Fig. 2 Sodium dodecyl sulphate (SDS)/polyacrylamide gel of purifi- 
cation of the CelA protein. M molecular size marker (kDa), / nickel 
nitrilotriacetase column flow-through after sample loading, 2 flow- 
through after column washing, 3 flow-through after column elution 
with 1 M imidazole, 4 flow-through after second elution of column 
with 1 M imidazole. Arrow the purified CelA protein band in 3 

enzyme activity was observed, as measured by the dini- 
trosalicylic acid method. Samples of the heated enzyme 
were boiled for different lengths of time, run on a SDS/ 
PAGE gel containing CMC and stained both for activity 
and for protein (Fig. 3). The unboiled control showed 
three activity bands with an apparent molecular mass of 
35-46 kDa. After boiling, activity disappeared and a 



kDa 1 2 3 4 5 




Fig. 3 SDS/polyacrylamide gel electrophoresis of recombinant cellu- 
lase treated with proteinase K. Lanes: 7 low-molecular-mass markers 
from Pharmacia, 2 unboiled sample, 3 sample boiled for 2 min in 
mercaptoethanol, 4 sample boiled for 5 min, 5 sample boiled for 10 
min 



protein band of about 26 kDa appeared (Fig. 3). By 
running proteinase-treated and untreated samples in 
parallel in a SDS/PAGE Phast gel it was confirmed that 
the treated protein is 1-2 kDa smaller than the un- 
treated one (data not shown). No enzyme activity was 
connected with the protein monomer. 

Samples of the recombinant enzyme and a previously 
described cellulase partially purified from R. marinus 
(Hreggvidsson et al. 1996) were run together on a gel 
containing CMC (Fig. 4). The native activity appeared 
in a position corresponding to the smallest activity band 
of the recombinant protein. In order to see if CMC was 
affecting the mobility of the active enzyme, samples 
identical to those shown in Fig. 4 were run on a SDS/ 
PAGE gel without CMC. Activity was detected in an 
agarose overlay with CMC. The same bands were seen 
and no difference in mobility was observed (data not 
shown). It is noted that the same activity bands were 
seen whether the enzyme had been heated with protease 
or not (Figs. 3, 4). The nature of the apparent aggregates 
that represent the active protein is not known. 

Some properties of the R. marinus cellulase 

The activity of purified cellulase was investigated at 
different temperatures and pH values. 

The temperature for maximal activity was determined 
by incubating the enzyme at different temperatures ac- 
cording to the standard assay. As shown in Fig. 5, the 
initial activity was still increasing between 90 °C and 
100 °C. Thermal stability was investigated by incubating 
the enzyme for up to 8 h at 90 °C (Fig. 6). Both prote- 

kDa 1 2 3 4 




Fig. 4 SDS/polyacrylamide gel electrophoresis of native and recom- 
binant cellulase from R. marinus. The gel contained carboxymethyl- 
cellulose and was stained for activity and protein. Lanes: 7 low- 
molecular-mass markers from -Pharmacia; 2 recombinant cellulase 
treated with proteinase K, unboiled sample; 3 supernatant from 
E. co//'-expressed recombinant clone, unboiled; 4 partially purified 
cellulase from R. marinus, unboiled 
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Fig, 5 Influence of temperature on the activity of CelA from 
R. marinus. Enzyme was measured according to the standard assay 
at 60, 70, 75, 80, 90, 95 and 100 °C. The highest level of activity, 3.44 
U/mg, was obtained at 100 °C and was set at 100% 

ase-treated and untreated enzyme was. tested. The pro- 
tease-treated enzyme showed more heat stability than 
the untreated one. After 8 h incubation the treated and 
untreated enzyme retain 75% and 45% of their activity 
respectively. 

The pH optimum of the purified cellulase was deter- 
mined by incubating the enzyme at different pH values 
according to the standard assay (Fig. 7). The enzyme 
showed a pH optimum of 6-7. 

The enzyme was found to hydrolyse polysaccharides 
with p 1-4 and P 1-3- 1-4 linkages. The specific activity of 
the purified enzyme at 70 °C in CMC, lichenan and 
glucosomannan was 1.0, 1.7 and 2.3 U/mg, respectively. 

125—1 : 1 




oH 1 1 \ 1 

0 2 4 6 8 10 

Time(h) 

Fig. 6 Thermal stability of CelA from R. nwrmus. Approximately 3.5 U 
enzyme was preincubated for 1,2, 4 and 8 h at 90 °C Residual activity 
was measured according to the standard assay. □ Untreated cellulase, 
O cellulase treated with proteinase K with the histidine tail removed 
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PH 

Fig, 7 Influence of pH on the activity of CelA from R. marinus. 
Enzyme activity was measured at pH 4, 5, 6, 7, 8, 9 and 10. The highest 
level of activity, 0.77 U/mg, was obtained at pH 6 and was set at 
100% 

The extent of hydrolysis measured after 48 h at 70 °C 
was 13% for CMC, 51% for lichenan and 34% for 
glucosomannan. For the highly substituted CMC2 the 
extent of hydrolysis was only 4% (specific activity 0.12 
U/mg). No activity was found with Avicel, microgran- 
ular cellulose, birch xylan or laminarin (plate assays). 



Discussion 

In this paper we describe the cloning, sequencing and 
overexpression of a gene, eel A, from the thermophilic 
eubacterium R. marinus. The gene encodes a thermos- 
table cellulase. It has a coding sequence corresponding 
to a polypeptide of 28.8 kDa, including a putative signal 
peptide of 17 amino acids. The CelA enzyme is able to 
hydrolyse pi-4 linkages in carboxymethyl cellulose and 
mixed P-glucans (lichenan) but does not cleave pi-3 
linkages, as shown by its inability to hydrolyse lami- 
narin, a homopolymer of pi-3-linked glucose residues. 
Activity towards birch xylan or crystalline cellulose was 
not detected. On the basis of these observations the CelA 
enzyme can be classified as an endo-l,4-P-glucanase (EC 
3.2.1.4). 

A thermostable cellulase from R. marinus was re- 
cently described. In our study the apparent molecular 
mass of this enzyme on SDS/PAGE gels was found to 
correspond to one of three bands of activity found for 
the recombinant enzyme. Comparing the characteristics 
of the recombinant and native cellulases (Hreggvidsson 
et al. 1996, and the present study) we find no significant 
differences. The highest measured activity on CMC was 
at 100 °C for both enzymes, the pH optimum was 7 for 
the native and 6-7 for the recombinant enzyme and the 
specific activity at 70 °C was 1.4 U/mg and 1.0 U/mg for 
the native and recombinant enzyme respectively. No 
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differences have been found in substrate specificity. It is 
important to note that only one band of cellulase ac- 
tivity was found in the crude, supernatant on SDS gels 
(Hreggvidsson et al.i996). We therefore think that most 
likely the native enzyme is encoded by the celA gene. 
Enzyme activity in SDS/PAGE gels was associated with 
apparent aggregates of the recombinant protein. Noth- 
ing can be concluded about the nature of these aggre- 
gates. 

A thermostable R. marinus glucanase, estimated to be 
29.7 kDa, has also been characterized (Spilliaert et al. 
1994). This enzyme had no activity on CMC cellulose 
but high activity on lichenan and laminarin. It therefore 
differs considerably in specificity from the recombinant 
and native enzymes of this study. It belongs to glycosyl 
hydrolase family 16. 

On the basis of sequence comparisons, the CelA en- 
zyme was assigned to glycosyl hydrolase family 12 
(Fig. 1). The overall homology between CelA from 
R. marinus and the 1 1 cellulases shown in Fig. 1 is low. 
There is, however, a box of three conserved amino acids 
to be found among the 1 1 cellulases, starting at Glu 159 in 
the R. marinus enzyme (E-M-W) as well as eight other 
conserved amino acids: Asn 60 ' Trp 62 , Pro 172 , Gly 174 , 
Trp 187 , Phe 218 , Gly 240 and Glu 242 . In spite of low overall 
identity between the protein sequences, it is interesting 
to compare the thermostable candidates (R. marinus 
CelA and T. maritimd and T. neapolitana CelA and 
CelB) to their mesophilic counterparts. By comparing 
small proteins that comprise only a single domain, the 
search for determinants of protein stability is somewhat 
simplified. Liebl et al. (1996) noted an increased per- 
centage of the charged residue Glu and a slight increase 
in Pro in CelA and CelB from T. maritima compared to 
the less thermostable cellulases of family 12. By ana- 
lysing differences in amino acid composition between the 
thermophilic and mesophilic enzymes in family 12 given 
in Table 1, it is evident that the percentage of Glu is 
higher in the thermophilic candidates, but an increase in 
Pro content is not observed. The increased percentage of 
Glu among the thermostable enzymes is not surprising 
since charged residues play a role in structure stabiliza- 
tion by salt bridge formation and by forming hydrogen 
bonds with neutral residues. Correlation between ther- 
mostability and the number of salt bridges and hydrogen 
bonds in proteins has indeed been observed (Macedo- 
Ribeiro et al. 1996; Szilagyi and Zavodszky 1995; Tan- 
ner et al. 1996); 

Glycosyl hydrolases hydrolyse glycosidic bonds via 
general acid catalysis that requires two amino acid res- 
idues: a proton donor and a nucleophile/base. In most 
glycosyl hydrolases studied, only Asp and/or Glu resi- 
dues have been found to perform catalysis (Davies and 
Henrissat 1995). Indeed, two conserved Glu residues are 
found in eel A from R. marinus, Glu 159 and Glu 242 . These 
amino acid residues correspond to Glu 158 and Glu 246 in 
£. carotovora CelS, and Glu 134 and Glu 218 in A. acu- 
leaius FI-CMCase. Torronen et al. (1993) have com- 
pared the E. carotovora and A. aculeatus enzymes to 17 



enzymes from glycosyl hydrolase family 11, where the 
catalytic role of Glu residues has been estabilished (FCo 
et al. 1992). They predicted that the two conserved Glu 
residues in the family 12 enzymes are essential for cat- 
alytic activity. 

Although a variety of factors have been implicated 
in the origins of thermostability, no general conclusion 
has emerged (Danson et al. 1996). It has been suggested 
that both thermophilic and mesophilic proteins attain 
a similar degree of flexibility at their respective optimum 
temperatures (Jaenicke 1996). Furthermore, compari- 
sons of the three-dimensional structures of thermostable 
proteins to those of their corresponding mesophilic 
counterparts show that high thermostability can 
be achieved without large changes of overall protein 
structure (Macedo-Ribeiro et al. 1996; Delboni et al. 
1995) It is likely that the same would apply for the 
thermostable cellulases of glycosyl hydrolase family 
12. However, it is clear that this question can only 
be resolved by determining the three-dimensional 
structure of thermophilic and mesophilic enzymes of this 
family. 
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Abstract The thermostable cellulase Cell2A. from 
Rhodothermus marinus was produced at extremely low 
levels when expressed in Escherichia coli and was cyto- 
toxic to the cells. In addition, severe aggregation oc- 
curred when moderately high concentrations of the en- 
zyme were heat-treated at 65 °C, the growth optimum of 
R, marinus. Sequence analysis revealed that the catalytic 
module of this enzyme is preceded by a typical linker se- 
quence and a highly hydrophobic putative signal peptide. 
Two deletion mutants lacking this hydrophobic region 
were cloned and successfully expressed in E. coli. These 
results indicated that the N-terminal putative signal pep- 
tide was responsible for the toxicity of the full-length en- 
zyme in the host organism. This was further corroborat- 
ed by cloning and expressing the hydrophobic N-termi- 
nal domain in E. coli, which resulted in extensive cell ly- 
sis. The deletion mutants, made up of either the catalytic 
module of Cell. 2 A or the catalytic module and the puta- 
tive linker sequence, were characterised and their proper- 
ties compared to those of the full-length enzyme. The 
specific activity of the mutants was approximately three- 
fold higher than that of the full-length enzyme. Both mu- 
tant proteins were highly thermostable, with half-lives 
exceeding 2 h at 90 °C and unfolding temperatures up to 
103 °C 
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Introduction 

An extremely thermostable endoglucanase, Cell2A, 
originating from Rhodothermus marinus was produced, 
in Escherichia coli and subsequently characterised 
(Halldorsdottir et al. 1998). Cell2A, which belongs to 
the family 12 glycosyl hydrolases, is among the most 
thermostable endoglucanases identified to date, with an 
activity optimum substantially higher than the growth 
maximum of R. marinus. Hyperthermophilic bacteria be- 
longing to Thermotoga species (Bok et al. 1998; Liebl et 
al. 1996) and the hyperthermophilic archaeon Pyococ- 
cus furiosus (Bauer et al. 1999) represent the only organ- 
isms that are capable of producing family 12 endoglu- 
canases with comparable thermostabilities to the R. ma- 
rinus enzyme. 

The glycosyl-hydrolase family 12 currently contains 
21 retaining enzymes (Coutinho and Henrissat 1999). 
Members of this family catalyse the hydrolysis of (3-1,4 
glycosidic bonds primarily in mixed-linkage (1— >3), 
(l-»4) p-D-glucans, and less efficiently in other cellu- 
lose substrates like carboxymethyl cellulose (CMC) or 
Avicel (Bauer et al. 1999; Liebl et al. 1996). The three- 
dimensional structure of the catalytic module of a repre- 
sentative of this family, Cell2B from the mesophilic 
bacterium Streptomyces lividans, has been determined. 
The enzyme displays (3-jellyroll fold very similar to fam- 
ily 11 xylanases (Sulzenbacher et al. 1997). 

In this study we demonstrate the toxicity of full- 
length Cell2A, manifested in extensive cell-lysis and 
very low production yields when expressed in E. coli 
strain BL21(DE3). Two deletion mutants of the cell2A 
gene lacking the sequence encoding the N-terminal hy- 
drophobic amino acid residues were cloned and ex- 
pressed in E. coli with high production yields.. The mu- 
tants were subsequently characterised with respect to 
their biochemical properties and thermostability. 
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Materials and methods 

DNA and protein sequence analysis 

The BLAST 2.0 sequence similarity search tool (on the server of 
the National Center for Biotechnology Information: http://www. 
ncbi.nlrn, nih.gov/) was used to investigate related sequences 
(Madden et al. 1996). Properties of Cell2A, based on the primary 
structure analysis, were predicted using the ProtScale (hrtp://www. 
expasy.ch/cgi-bin/protscale.pl) and PROSITE (hctp.7/www.expasy. 
ch/tools/scnpsite.htrnl) tools on the ExPASy server.. Secondary 
structure prediction was performed using the PHDsec algorithm 
(Rost and Sander 1993) on the Predict protein server (http://cubic. 
bioc.columbia.edu/predictprotein) 



Bacterial strains 

The Nova Blue and BL21 (DE3) (Novagen, Madison, Wis.) strains 
of E. coli were used for cloning and expression of the ceU2A mu- 
tants. The E. coli strain GE2205 (Technological Institute of Ice- 
land. Iceland), harbouring pET23bAH plasmid, was used for ex- 
pression of the cM2A gene (Hall dors do ttir et al. 1998). 

Construction of plasmids and sequencing 

Plasmids pET25b(+), with a T7/lac promoter, and pET23b(+), 
with a T7 promoter (Novagen), were used for expression of the 
truncated forms of the Cell2A protein. In addition to the differ- 
ence in promoters, pET25b(+) allows C-terminal fusion with a 
herpes simplex virus (HSV) immunodetection tag followed by a 
six-histidine purification tag, while the pET23b(+) allows C-ter- 
rninal fusion with a six-histidine purification tag (His) alone. 
Truncated forms of the cellulase described in this study were pro- 
duced either fused to the His-tag in pET23b(+) or to the HSV-His 
tag in pET25b(+) (Fig. 1). The gene fragments encoding: (1) 
Cell2A without the putative signal peptide (first 17 amino acids, 
named A(SP)cell2A\ (2) 223 amino acids from Cell2A (starting at 
amino acid 38), named A(SP t L)ceU2A\ and(3) the N-terminal 37 
amino acid residues of Cell2A, named A(CM)celI2A, were ob- 
tained by PGR utilising the appropriate primers (Table 1) and in- 
troducing restriction sites for cloning. A hot- start PCR protocol of 
35 cycles of denaturation at 95 °C for 1 min, annealing at 60 °C 
for 1 min, and primer extension at 72 °C for 1 min was employed 
using an AmpliTaq Gold polymerase (Perkin-Elmer AB, Foster 
City, Calif.) and the pET23bAH plasmid as template. The PCR 
fragments were separated on agarose gel and purified using 
the QLAJEX II kit (QIAGEN, Hilden, Germany). The ampiicons, 
designated A(SP)cell2A and A(SP,L)celI2A, were digested with 
Ndel-Hindlll restriction enzymes and cloned into pET25b(+) us- 
ing the same 'restriction sites in the plasmid polylinker, yielding 
the vectors pET25A(SP) and pET25A(SPL) respectively. Analo- 
gously, the same inserts were cloned in the pET23b(+) vector us- 
ing the same restriction sites, yielding the plamids pET23A(SP) 
and pET23A(SP.L) for A(SP)ceU2A and A(SP t L)cell2A respective- 
ly. The amplicon A(CM)cell2A was digested wiih Ndel-Xhol, pu- 
rified with the same kit as above, and cloned into pET25b(+) us- 
ing the same restriction sites, yielding the plasmid pET25A(CM). 




Fig. 1 Proteins studied in this work: A Cell2A. B A(SPJ-)Cell2A, 
C A(SP)Cell2A, D A(SP,L)Cell2AH, E A(SP)Cell2AH, F 
A(CM)Ce.U2AH. The proteins shown in A-C were expressed from 
the pET23b(+) ( vector, all others from the pET25b(+) vector. 
SP signal pepticle, L linker, CM catalytic module, Hsv HSV tag, 
H 6 hexahistidine tag 



A standard electroporation procedure was employed for transfor- 
mation of vectors into Nova Blue electrocompetent cells. A Gene 
Pulser II (Bio-Rad, Richmond, Calif.) connected to a Pulse Con- 
troller II (Bio-Rad) was used. Plasmids were prepared using the 
QIAGEN Mini Plasmid Kit The presence of cloned inserts was 
checked by restriction analysis, and plasmids were retransformed 
into the BL21(DE3) production strain using the same procedure. 
The presence of cloned inserts was verified by restriction analysis 
and nucleotide sequencing with the dideoxy chain termination 
method using the ABI PRISM Dyes Terminator Cycle Sequencing 
Ready Reaction (Perkin -Elmer). All restriction enzymes and T4 
ligase were from Life Technologies (Rockville, Md.), The en- 
zymes were used according to the manufacturer's protocols. 



Culture conditions and protein production 

For expression of truncated forms of the cellulase, E. coli strain 
BL21(DE3) harbouring the desired plasmids was grown on LB 
medium containing ampicillin (0.1 mg/ral) to OD 600 =0,7 and then 
induced with IPTG to a final concentration of 1 mM. Growth was 
continued for an additional 2.5-3 h and the cells were harvested 
by centrifugation. The pellet was resuspended in 20 mM Tris-HCl, 
pH 7.4, 20 rnM imidazole, 0.5 M NaCl buffer. The cells were dis- 
rupted with a UP400S so licator (Dr. Hielscher GmbH, Stahnsdorf , 
Germany) at 60 W cm.- 2 , 3x120 s on ice. The crude extract was 
centrifuged at 27,000x£ for 20 min at 4 °C and the supernatant 
stored at -20 °C for purification. Fed-batch cultivation for produc- 
tion of the full-length protein was carried out in a 3-1 fermentor 
under conditions as described previously (Nordberg Karlsson et 
al. 3998). Bacteria were grown to OD 60Q =32, induced by the addi- 
tion of IPTG to a final concentration of 0.5 mM, and grown for 
90 min following induction. The fermentation culture was then 
run three times through a Gaulin high-pressure homogenizer 
(AP V-Schroder, Liibeck, Germany) at 700 atm to disrupt the cells. 
The cell extract was centrifuged (27,000x£, 4 °C, 15 min) and the 
supernatant was frozen at -20 °C. 



Table 1 Primers used for 
amplification of the examined 
deletion forms of cell 2a. The 
sequences are shown from 5' to 
3'. The restriction sites used for 
cloning are underlined 



Amplified fragment 



A(SP t L)cell2A 



A(SP)cell2A 



A(CM)cell2A 



Number 



Oligonucleotides 



F: AGGACTC££X2XQACCGTCGAGCTGTCGG 
R: ACCTGA QAAQCTT CTGCACCGTTACOGA 

F: AGCTCCAI^mTGCGACTGGCTCTTTCC 
R: ACCTGAGAAGCHCTGCACCGTTACGGA 

F: GATGTC££X£EGACCGTCATGCGTGCG 
R: ATTAT TCTCGAGC GGCTCGGGCTCACrCr 
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Protein purification 

The crude cell extracts, except the one containing the full-length en- 
zyme, were heat-treated at 65 °C for 30 rain, centrifuged at 27,000x£ 
for 15 min, and the supernatants were loaded onto an immobilised 
metal-ion affinity chromatography (IMAC) column. Recombinant 
proteins were purified utilising a C-teiminal His tag as described 
elsewhere (Abou Hachera et al, 2000). The eluate was dialysed three 
times for several hours against 45 volumes of 20 mM Tris-HCl, 
pH 7.5. Gel filtration chromatography was employed for separation 
of cleaved and uncleaved A(SP)Cell2AH protein. A 50-cm-long col- 
umn with an internal diameter of 17 mm and packed with Sephadex 
G-50 resin (Amersham Pharmacia Biotech, Uppsala, Sweden) was 
used Samples of 1.2 ml were loaded onto the column and eluted 
with 20 mM Tris-HCl. pH 7.5, with a flow rate of 0.8 ml/min. 



Protein characterisation 

Protein production was analysed by SDS-PAGE (Laemmli 1970), 
and the gels were stained with Coornassie brilliant blue G250 
(Merck, Darmstadt, Germany). Enzyme activity was detected by an 
overlayer agarose 1% (w/v) gel containing CMC 0.02% (w/v). The 
SDS/PAGE gel was washed with phosphate buffer, pH 7.0, contain- 
ing 1% Triton X-100 (Merck) for 20 min. Thereafter, the gel was 
washed with phosphate buffer, pH 7.0, covered with the overlayer 
gel, and incubated for 2 h at 65 °C. Subsequently, the agarose gel 
was stained in 1% (w/v) Congo red solution and destained with 1 M 
NaCl. The dinitro salicylic acid (DNS) stopping method (Miller et 
al. 1960) was carried out for enzyme activity measurements. Sam- 
ples were incubated with 0.9% (w/v) CMC in 20 mM Tris-HCl, 
pH 7.5, at 65 °C for 20 rain (alternatively 30 min). The reaction was 
stopped by the addition of 1.5 volume of DNS solution (1% DNS, 
0.2% phenol, 0.05% Na^SOa, 1% NaOH, 20% Na-K/tartrate), and 
samples were subsequently boiled for 15 min. The amount of re- 
leased reducing sugars was estimated by absoibance at 550 nrn in 
96 -well micro titrc plates with glucose as standard Several sub- 
strates were used for enzyme specificity tests: CMC, phosphoric ac- 
id swollen cellulose (PASC) (Wood 1988), birch wood xylan, liche- 
nan, locust bean gum and guar gum (Sigma, St Louis, Mo.), gluco- 
mannan (Konjak) (Megazyme, Bray, Irleand) and Avicel (Merck). 
The temperature optimum was determined by running the activity 
assay at temperatures ranging from 40 to 105 °C in 20 mM Tris- 
HCl, pH 7.5, for 20 min. The pH optimum was studied by running 
the assay in 50 mM citrate-phosphate buffer (pH range 3.0-6.0), 
50 mM sodium phosphate, buffer (pH range 6.0-7.0), 50 mM Tris- 
HCl buffer (pH range 7.0-9.0), and 50 mM glycine-NaOH buffer 
(pH range 9.0-1 0.0) at 65 °C for 30 min. The enzyme concentration 
in the above assays was 0.04 mg/ml. Thermal deactivation was ex- 
amined by determining residual activity after incubating the enzyme 
(0.4 rag/ml) in the absence of substrates at 65 D C, 85 °C and 90 °C 
for different time periods in 20 mM Tris-HCl. buffer, pH 7.5. 



Differential scanning calorimetry 

Protein unfolding temperature (r m ), expressed as temperature with 
apparent maximum excess heat capacity (C p value), was measured 
by differential scanning calorimetry (DSC; using a VP-DSC mi- 
crocalorimeter (MicroCal, Northampton, Mass.) with a scan rate 
of 60 °C/h. Protein concentrations of 0,2 mg/ml were used and 
measurements were performed in 20 mM Tris-HCl, pH 7.5, with 
or without SDS 0,5% (w/v). 



Results 

Expression of Cell2A in E. coli 

Expression of the full-length cellulase (Cell2A) demon- 
strated that induction of recombinant protein synthesis 
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Fig. 2 Growth curves of Escherichia coli strain BL21(DE3) dur- 
ing expression of Cell2A: induced (•) and not induced (O), and 
A(SP,L)Cell2AH-mduced (A) and not induced (A). Induction 
time is marked with an arrow 
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Fig- 3 SDS-PAGE of crude cell content of E. coli expressing 
wild-type and truncated forms of Ccll2A- Cells were harvest- 
ed 2.5 h after induction. 1 Bacteria not induced, 2 Cell2A, 
3 A(SP)Cell2AH, 4 A(SP,L)Cell2AH 



resulted in significant cell lysis, observed as a decrease 
in optical density measured at 600 nm (Fig. 2). Signifi- 
cant changes in bacterial morphology were noted after 
induction. This behaviour suggested that the produced 
protein was toxic to the host, Cellulase production was 
extremely low, and no clear, recombinant protein band 
was visible on the Commassie-stained polyacrylamide 
gel (Fig. 3). An activity-stained overlayer gel showed 
two faint bands (data not shown). The higher molecular 
mass band (30 kDa) corresponded to the full-length en- 
zyme, whereas the lower molecular mass band (28 kDa) 
corresponded to the protein lacking about 20 residues, 
Cell2A was purified by the IMAC chromatography 
without prior heat treatment of crude cell extract. Partial- 
ly purified protein was then heat-treated. Substantial pre- 
cipitation of recombinant protein was observed, resulting 
in a very low yield of pure Cell2A.= Samples incubated 
for different periods had similar activity, judging from 
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Fig. 4 Hydrophobicity {grey 
line) and flexibility (black line) 
calculated for Cell2A, using 
the PROTSCALE tool with a 
window size of 19 residues. 
Both scales are normalised to 1 
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Expression of the N-terminal fragment of Cell2A 

To verify whether the N-terminal hydrophobic part of 
Cell2A was responsible for toxicity, a deletion mutant 
encoding the first 37 amino acids (A(CM)Cell2AH) was 
e and t he lower mo 'pni ^ r mass Iflim were Jjg iriv cloned and expressed. Following induction, there was a 
si milarT Purified protein was not stable at 4 °C, ancfafterf^drop in optical density, and the cells behaved strikingly 

similar to those expressing the full-length enzyme (data 



the activity-stained overlayer gel. This suggests that only 
a minor part of Cell2A was in the active form. Interest- 
ingly, although the major fraction of the recombinant 
protein was expressed in the low molecular mass form, 
the activity bands corresponding to the full-length en- 
zyme and the lower mo ^nil?l r ma « g fnrrT ? wprp fairly 



(^several weeks of storage lost its activity^— 



not shown). 



Sequence analysis of Cel 12 A 

In an effort to explain the expression problems encoun- 
tered, the amino acid sequence of the protein was analy- 
sed. As previously described (Halldorsdottir et al 1998), 
Cell2A displays highest sequence similarity to the endo- 
glucanase Cell2B from 5. lividans 66 (Wittmann et al. 
1994). The catalytic module of Cell2B aligns with that 
of Cell2A downstream a 4x-repeat of a glutamic acid 
and proline doublet, [EP] 4 . The N-terminal part of 
Cell2A does not share similarity with any known do- 
main; however, the amino acid sequence analysis using 
PROSITE tool showed that it resembles an E. coli signal 
peptide (Hayashi and Wu 1990), with a predicted cleav- 
age site between amino acid residues 17 and 18. The 
amino acid profile generated using the optimised match- 
ing hydrophobicity (OMH) scale (Sweet and Eisenberg 
1983) demonstrated that this N-terminal part of Cell 2A 
is the most hydrophobic in the whole molecule (Fig. 4), 
This hydrophobic stretch is followed by a highly polar 
region rich in glutamic acid and proline residues. The 
profile generated using the average flexibility index 
(Bhaskaran and Ponnuswamy 1988) showed that the 
highly rigid N-terminal part is followed by a highly flex- 
ible part encompassing the [EP] 4 region (Fig. 4). This is 
supported by secondary structure prediction, yielding a 
random coil structure for the flexible part (between ami- 
no acid residues 20 and 40). 

These observations suggest that the Cell2A molecule 
consists of a catalytic module and a short hydrophobic 
N-tercrunal fragment connected by a flexible linker. In 
addition, the toxicity of the full-length enzyme in E. coli 
could possibly be attributed to this N-terminal hydropho- 
bic part. 



Expression of Cell2A deletion mutants 

To overcome bacterial host death during expression of 
Cell2A, deletion mutant proteins were designed* The 
nucleotide sequence of the cloned gene fragments encod- 
ing the catalytic ally active truncated forms of the cellula- 
se revealed a single T252S mutation in the primary 
structure of Cel 12 A. During expression of the catalytic 
module (A(SP,L)Cell2AH), the protein was efficiently 
produced and no cell lysis was observed (Fig. 2). The 
growth rate was slightly inhibited because of intensive 
production of recombinant protein (approximately 40% 
of total protein content, Fig. 3). The protein was stable at 
65 °C (data not shown) and displayed activity on CMC. 
The mutant protein (A(SP)Cell2AH) lacking the putative 
signal peptide (first 17 amino acids) was effectively pro- 
duced in active form. However, SDS-PAGE analysis 
(Fig. 3) and activity staining showed that the mutant 
gave rise to two bands on the gels. The apparent molecu- 
lar masses of the bands were 29 and 34 kDa, compared 
to the expected molecular mass of 30.3 kDa. The lower 
molecular mass band could be a result of protease cleav- 
age in the N- terminus, as both forms of the mutant pro- 
tein could be purified on thjralrihity^ 
had^he— Mstidirie tags ^Both A(SP)Cell2AH tmST 
A(SP,L)Cell2AH were stable at 4 ° C for several weeks. 

Characterisation of truncated forms of the Cel 1.2A 

The influence of temperature on activity was investigat- 
ed by incubating the proteins at different temperatures 
ranging from 40 to 100 °C according to the standard 
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Table 2 Some properties of recombinant Cell2A and its truncated CMC Carboxymethyl cellulose, nd not determined, T m unfolding 
forms. One unit corresponds to the amount of enzyme that releases temperature in 20 mM Tris-HCl (pH 7.5), T m * unfolding tempera - 
1 ^mol of reducing sugars/min under described assay conditions, ture in 20 mM Tris-HCl (pH 7.5)+0.5% SDS 



Protein 


T 

■* optlm. 




Specific activity 
on CMC 


t in at 90 °C 


Cell2A 

A(SP,L)Cell2AH 
A(SP,L)Cell2A 
A(SP)Cell2AH 
A(SP)Cell2A 


^100 °c 
90 °C 
nd 

£100 °C ' 
£100 °C 


Aggregation/120.0 °C • 

94.5 °C/106.6 b C 

nd 

102.9 °C/nd . 
103.2 °C/120.0 °C 


0.7 U/mg 
2.7 U/mg 
nd 

3.1 U/mg 
3.1 U/mg 


3h 
2h 
2.5 h 
5h 

80% after 16 h; 45 min at 100 °C 



assay (data not shown). The initial activity of Cell2A 
and A(SP)Cell2AH increased up to 100 °C, but 
A(SP,L)Cell2AH displayed maximal activity at 90 °C 
(Table 2). The truncated forms of cellulase retained 
95-100% of their initial activity after 16 h of incubation 
at 85 °C. The half-life at 90 °C for Cell2A, for 
A(SP,L)Cell2AH, and A(SP)Cell2AH were 3 h, 2 h, and 
5 h, respectively. The 18-amino-acid HSV tag preceding 
the histidine tag in the pET25b vector had, however, a 
negative influence on the thermostability of the investi- 
gated mutants. Expression in the pET23b vector not en- 
coding this tag gave mutant proteins with longer half- 
lives (Table 2). 

The pH optima for the catalytic module and the full- 
length enzyme were similar. Both enzymes were active 
. over a broad pH range and maintained more than 40% of 
maximal activity between pH 4.0 and 8.0 and more than 
20%atpH 10. 

The catalytic module of Cell2A had the same sub- 
strate specificity as the full-length protein, both being 
active on soluble polysaccharides with (3-1— >4 and 
P- 1— »3,l-4 linkages (CMC, lichenan, and glucomannan) 
with the highest activity on lichenan (7.2 U/mg). The 
catalytic module of Cell2A and the full-length enzyme 
expressed very low activity on Avicel and were not ac- 
tive on xylan and galactomannan. The specific activity 
on CMC determined for the truncated forms of cellulase 
was three to four times higher than that of the full-length 
protein (Table 2). 

Moderate concentrations of the onic detergent SDS 
(0.5% w/v) decreased the activity of A(SP,L)Cell2AH 
and A(SP)Cell2AH by 14% and 6%, respectively. How- 
ever, the full-length enzyme showed 30% higher initial 
activity in the presence of &DS. 



DCS studies 

No distinct peak could be observed on the thermogram 
of Cell2A. Considerable aggregation of the protein (ex- 
pressed as fluctuations of the apparent Cp value), start- 
ing at temperatures of 65 °C-70 °C, was observed (data 
not shown) in 20 mM Tris-HCl, pH 7.5. The unfolding 
of the catalytic module A(SP,L)Cell2AH and A(SP)- 
Cell2AH occurred at 95 °C and 103 °C, respectively, 
both processes being irreversible. 



In the case of Cell2A, the presence of 0.5% SDS pre- 
vented aggregation, and a thermal transition was ob- 
served at 120 °C. For the deletion mutants A(SP)- 
Cell2AH and A(SP,L)Cell2AH, the unfolding tempera- 
tures were 120 °C and 106.5 °C, respectively, in the 
presence of SDS. 



Discussion 

In a previous study, the endoglucanase Cell2A from 
R. marinus was cloned and produced in E. coli 
(Halldorsdottir et al. 1998). Although the production 
levels exceeded what could be obtained from cultiva- 
tion of R. marinuSy the yield was substantially lower 
than what the T7/lac promoter based system is expected 
to give. The primary structure analysis of Cell2A gave 
a possible explanation for the low production yield, and 
revealed an interesting organisational feature of this en- 
zyme. The hydrophobicity and other features deduced 
from the primary structure analysis of the first 17 ami- 
no acid residues of the enzyme were typical of signal 
peptides (Creighton 1993; Hayashi and Wu 1990). This 
N-terminal putative signal peptide is connected to the 
catalytic module by a linker sequence typical for do- 
main linkage in glycosyl hydrolases (Tomme et al. 
1995). Despite their structural diversity, linkers are typ- 
ically rich in prolines, hydroxy amino acids, alanine or 
glycines, and the arrangement of these, amino acids 
seems to impart the flexibility necessary for function. 
The 4x-repeat of glutamic acid and proline preceding 
the catalytic domain of Cell2A is reminiscent of the 
linkers that separate the different modules of R. marin- 
us xylanase XynlOA (Nordberg Karlsson et al. 1997). 
Assuming that the enzyme is anchored to the cell via 
the putative signal peptide, it makes good sense -to have 
a flexible arm separating the plausibly rigid anchor 
from the catalytic core. The occurrence of a linker 
sequence between the putative signal peptide and the 
catalytic module of this enzyme is shared with another 
family 12 glycosyl-hydrolase from the hyperthermo- 
philic archaeon R furiosus (Bauer et al. 1999). Unfor- 
tunately, the wild-type R furiosus enzyme was not 
characterised, and it is not possible to draw any conclu- 
sions regarding signal peptide processing or cell attach- 
ment. 
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Our results strongly suggest that the putative signal 
peptide is responsible for the toxicity of the full-length 
enzyme in E. coli. Successful expression with significant 
improvement of production yields could be achieved fol- 
lowing its deletion. Furthermore, results from the expres- 
sion of the first 38 amino acid residues of the protein in 
E. coli support this hypothesis. Similar expression prob- 
lems were reported earlier for bacterial porins and mem- 
brane proteins (Fenno et al. 1996; Miroux and Walker 
1996), but not for the family 12 glycosyl hydrolases. 
A plausible explanation for the toxic behaviour could be 
the processing of the putative signal peptide by E. coli, 
ultimately resulting in interference with the secretory ap- 
paratus of the bacterium (Fenno et al. 1996). The lower 
molecular mass band observed on the activity- stained gel 
of the full-length enzyme might be a result of processing 
of the putative signal peptide by E. coli. It is, however, 
unclear how the enzyme is processed in R. marinus. Al- 
though the wild-type cellulase purified from cultivation 
supernatants (Hreggvidsson et aL 1996) has been sug- 
gested to be the same enzyme as Cell2A (Halldorsdottir 
et al. 1998), the N-terminal sequence of the enzyme has 
never been determined. The recombinant full-length en- 
zyme was not stable in solution, and heating at 65 °C 
caused extensive aggregation at moderately high protein 
concentrations. This raises the question of whether the 
wild-type cellulase is secreted into the medium or if it 
remains cell- associated, with its hydrophobic pan em- 
bedded in the hydrophobic milieu of the cell membrane. 
Observations supporting the latter hypothesis are the 
DSC thermograms and the activity measurement of full- 
length Cell 2A in the presence of SDS. Both show that 
SDS is necessary to prevent aggregation of full-length 
enzyme in solution. In the case of the full-length en- 
zyme, the specific activity was slightly higher in the 
presence of SDS. By contrast, the presence of SDS led to 
a decrease in the specific activity, of both truncated en- 
zymes under the same conditions (data not shown). Cell- 
attached glycosyl hydrolase activities were reported ear- 
lier from R. marinus cultivations, where considerable ac- 
tivity was detected in the cell fraction (Blticher et al. 
2000; Dahlberg et al. 1993). In an effort to assess wheth- 
er this was the case for Cell2A,^ marinus was cultivat- 
ed in the presence of CMC and the endoglucanase activi- 
ty was assayed. The activity in the sonicated cell fraction 
was several-fold higher than in the supernatant fraction 
(data not shown), which appreciably enforces the cell at- 
tachment hypothesis. 

v The deletion mutants of Cell2A did not deviate from 
the full-length enzyme in their pH optima or substrate 
specificity. This is consistent with the fact that the integrity 
of the catalytic core was maintained, and the mutations in- 
troduced did not result in appreciable changes of the theo- 
retically estimated isoelectric point of the enzyme. The ap- 
parent deviation in molecular mass of the mutant protein 
containing the linker region and the catalytic module could 
be explained by the high thermostability of the construct 
or unusual electrophoretic mobility of the linker region. 
The mutation (T252S in Cell2A) revealed by nucleotide 
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sequencing of the truncated forms of Cell2A could be at- 
tributed either to a sequencing error in the original se- 
quencing of the gene, or it could have been introduced 
during PCR amplification. Either way, it is highly unlikely 
that this would have a significant effect on the properties 
of the enzyme or its truncated forms, since it occurs in a 
non-conserved region, which in the structure' of the homol- 
ogous Cell2B from S. lividans is distant from the active 
site. In addition, the change from threonine to serine 
means that the chemical nature of the residue is main- 
tained, as threonine and serine are both polar uncharged 
amino acids. However, deletion of the first 17 amino acid 
residues had a pronounced effect on the stability of the en- 
zyme. The hydrophobic V sidues of ^ P uCative signal 
peptide are likely to be solvent-exposed and consequently 
would comprise the thermodynamic driving force for ag- 
gregation. In this context, heating the full-length protein 
merely accelerates the aggregation process rather than 
causes it, since hydrophobic interactions increase with 
temperature in this region. Keeping this in mind, it is pos- 
sible to reconcile the results from the thermal deactivation 
and temperature optimum. One of the truncated forms of 
Cell2A (A(SP)Cell2AH) exhibited better apparent ther- 
mal stability than the full-length enzyme, as suggested by 
the half-lives at 90 °C. It is likely that the loss of activity 
of the full-length enzyme is a result of aggregation, which 
would render the active site of the enzyme inaccessible to 
its substrate. In addition, the same explanation holds to ra- 
tionalise the apparent increase in specific activity of the 
deletion mutants compared to the full-length enzyme. De- 
spite this apparent increase in specific activity, the catalyt- 
ic core of the studied enzyme still. displays a relatively low 
specific activity at 65 °C, indicative of a rigid structure of 
the enzyme at this temperature. 

The results presented in this work suggest that the en- 
zyme could be cell-associated. Additional studies are, 
however, needed to verify cell attachment and to assess 
the mechanism by. which it is mediated. The maintained 
thermostability, broad pH range activity, and production 
feasibility of the truncated Cell2A forms make them po- 
tentially interesting for industrial biptechnological appli- 
cations. 

Note added in proof During the editing process, the full-length 
Cell2A encoding insert in pET23bAH was resequenced. It has 
been confirmed that the T252S was not a mutation, but rather a se- 
quencing error as suggested in the discussion. 
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